Thesaurus-Based Feedback to Support Mixed Search and Browsing Environments

نویسندگان

  • Edgar Meij
  • Maarten de Rijke
چکیده

We propose and evaluate a query expansion mechanism that supports searching and browsing in collections of annotated documents. Based on generative language models, our feedback mechanism uses document-level annotations to bias the generation of expansion terms and to generate browsing suggestions in the form of concepts selected from a controlled vocabulary (as typically used in digital library settings). We provide a detailed formalization of our feedback mechanism and evaluate its effectiveness using the TREC 2006 Genomics track test set. As to the retrieval effectiveness, we find a 20% improvement in mean average precision over a query-likelihood baseline, whilst increasing precision at 10. When we base the parameter estimation and feedback generation of our algorithm on a large corpus, we also find an improvement over state-of-the-art relevance models. The browsing suggestions are assessed along two dimensions: relevancy and specifity. We present an account of per-topic results, which helps understand for what type of queries our feedback mechanism is particularly helpful.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MedTextus: An Ontology-enhanced Medical Portal

In this paper we describe MedTextus, an online medical search portal with dynamic search and browse tools. To search for information, MedTextus lets users request synonyms and related terms specifically tailored to their query. A mapping algorithm dynamically builds the query context based on the UMLS ontology and then selects thesaurus terms that fit this context. Users can add these terms to ...

متن کامل

Navigating the virtual library: A three-dimensional browsing interface for information retrieval

Two broad types of solutions have been proposed: to automatically or semi-automatically refine a query, or to provide the user with aids for constructing queries and dealing with poor search results. Examples of the former approach include: relevance feedback, in which relevance assessments supplied by the user for previously retrieved documents are used to reformulate the query [20]; word stem...

متن کامل

A Search Engine for Browsing the Wikipedia Thesaurus

Wikipedia has become a huge phenomenon on the WWW. As a corpus for knowledge extraction, it has various impressive characteristics such as a huge amount of articles, live updates, a dense link structure, brief link texts and URL identification for concepts. In our previous work, we proposed link structure mining algorithms to extract a huge scale and accurate association thesaurus from Wikipedi...

متن کامل

JIS 28/2 00 prelims

User interfaces to information retrieval systems play a major role in assisting users to search, browse and retrieve information relevant to their needs. This paper provides a review of a category of information retrieval interfaces that are enhanced by incorporating standard thesauri as part of their searching and browsing facilities. A brief account of the rationale behind the integration of ...

متن کامل

Searchling: User-Centered Evaluation Searchling: User-Centered Evaluation of a Visual Thesaurus-Enhanced Interface for Bilingual Libraries

In this paper, we describe a qualitative user study of Searchling – an experimental visual interface that allows users to leverage a bilingual thesaurus for query formulation and enhancement. The design of Searchling is based on theories of thesaurus-based interface design from Shiri et al. [1], combined with the principles of rich-prospect browsing [2]. The Searchling interface provides the us...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007